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@ Special Instruction processing unit for data processing system, 

@ A special instruction processor, such as a floating point 
accelerator processor, that processes a special class of instruc- 
tions. Each instruction identifies the number of operands to be 
processed as well as the number of data words required to be 
transfen'ed for each operand. The central processing unit 
retrieves and decodes each instruction^ and transfers the 
special instructions to the special instruction processor. Both 
processors decode the special instructions to detennine the 
numbers of operands and data words to be transferred. The 
central processing unit retrieves the data words from memory 
and transmits them to the special instruction processor. The 
special instruction processor processes the Instruction and 
signals the central processor when finished. The central 
processor then causes the special instmction processor to 
transmit the processed data back to the central instruction 
processor. 
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SPECIAL INgTRUCTION PROCESSING UNIT FOR DATA PROC ESSING SYglEM 
Background of the Invention 

Field of the Invention 

This invention relates to electronic digital data 

processing systems incorporating a special instruction 

5 processor such as a floating point accelerator processor. 

More specifically/ the invention relates to a new and 

improved interconnection arrangement between the floating 

point processor and a central processor unit. 

Description of the Prior Art 

IQ A digital data processing system comprises three 

elements: namely, a memory element, an input/output element, 
and a processor element. The memory element stores 
information in addressable storage locations, with each 
location having a unique address. The information stored in 

15the memory includes data, or "operands", and instructions for 
processing the operands. The processor element transfers 
information to and from the memory element, interprets the 
information as either instructions or operands and processes 
the operands in accordance with the associated instructions. 

20The input/output element, under control of the processor 
unit, also communicates with the memory element in order to 
transfer operands and instructions into the system and obtain 
processed data from it. 

Operaiias processed by a processing unit may take a 

25number of forms. In many processing operations, operands are 
in the form of integers or whole numbers. In other 
operations, operands are in a floating point format, that is, 
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in what is typically known as "scientific notation", m this 
form, an operand has two parts, including an exponent and a 
mantissa, m a binary-based system used in most computers, 
the mantissa is a fraction with a binary point to the 
5 immediate left of the most significant digit, and the 
exponent represents the power of two to which the mantissa 
must be taken to obtain the value of the number. Each of the 
mantissa and the exponent may contain a sign, either positive 
or negative. 

10 Similarly, operands for trigonometric functions may have 

a different form from both integer and floating point 
operands. For instance, some trigonometric functions may be 
repetitive for , each multiple of three hundred and sixty 
degrees, if the operands are expressed in degrees, or may 

15represent or require specific operations with respect to -pi" 
if the operands are expressed in radian notation. In 
addition to the scientific notation or special tr igononjetr ic 
function, other special operand notations are conceivable. 
Each of these special classes of operands may require 

20special handling by the processor. With respect particularly 
to floating point instructions, a number of arrangements for 
processing such operands have been attempted in the prior 
art. In some data processing systems, the central processing 
unit contains special control circuitry for executing the 

255pecial instructions. Even with this hardware approach, 
floating point instructions usually require significantly 
more time to complete than do instructions on conventional 
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integer operands. As a central processor unit executes all 
instructions^ including the floating point instructions in 
seriatum, the addition of floating point instructions can 
significantly increase the overall time to complete a given 
5program. 

In another approach, the operands are processed using 
subroutines comprising sets of the machine instructions to 
implement the floating point functions. The central 
processor unit merely uses a floating point instruction as an 

lOinstruction directing the processor to execute the 
appropriate subroutine. This approach enables the functions 
performed by the special class of instructions to be altered 
relatively simply. However, this approach is considerably 
slower than the hardware approach, as the mantissas and 

15exponents of the operands must be handled separately. 

In other prior data processing systems, separate 
functional modules operating in parallel execute the 
instructions of the data processing system. In some cases, 
each module can execute all of the instructions on the 

20different classes of operands, or specific modules may be 
assigned to process instructions on classes of operands. In 
either case, each module operates independently by retrieving 
data from or storing data in the memory unit directly. A 
controlling module may retrieve the instructions in seriatum 

25and transfer it to either an idle module, or to the module 
designed to execute that operation. 
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Since the modules operate in parallel, they n.ay operate 
siniultaneously, within the constraint that normally one 
instruction cannot be executed until a previous instruction 
has been executed. While the time required to perform a 
5 sxngle operation is about the same as in the prior hardware 
approach, the parallel nature of the module significantly 
reduces the time to execute a program contained in the 
special instructions, as a free module may be used to process 
interrupts. However, as each module must be capable of 
10 operating independently, circuit redundancy is necessary. 
Each module usually performs only one functxon, and is not 
readily converted into other functions. 

in a fourth arrangement, used on conjunction with 
certain PDP-ll data processing systems, a separate module, 
15 termed a -floating point accelerator", processes instructions 
:n conjunction with floating point operands. The instructx.n 
indicates whether the operand is floating point or integer, 
and xf the instruction indicates that the operand is floating 
poxnt, the central processor unit passes the instruction to 
20 the floating point accelerator. The floating point 

accelerator then decodes the instruction, and requests the 
central processing unit to retrieve operands from the memory 
unit for memory, and to store the processed data in the 
memory. While this approach allows the central processing 
25 unit the freedom to process such thxngs as interrupts while 
the floating point accelerator is in the process of executing 
the floating point instruction, the required interaction 
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between the central processing unit and the floating point 
accelerator unnecessarily complicates both units. 

It is therefore an object of this invention to provide a 
new and improved floating point accelerator. 
5 It is yet another object of this invention to provide a 

new interconnection arrangement between the floating point 
accelerator and the central processing unit. 

Summary 

2^0 In brieff this invention provides a new and improved 

interconnection arrangement between the central processing 
unit and a processing module for processing a special class 
of instructions that operate on a uniquely defined class of 
operands. The central processing unit retrieves all of the 

15 instructions, in series, in a conventional manner, and 
decodes the instructions. An image of each instruction is 
passed to the special instruction processor. When an 
instruction is received which requires processing of one of 
the special class of operands, the central processing unit 

2othen retrieves the data words comprising the operands from 
the memory and passes them to the special instruction 
processor. 

After receiving the instruction the special instruction 
processor also decodes the instruction and proceeds to 
25re.ceive the data words comprising the operands. The special 
instruction processor then processes the operands in a 
conventional manner, and prepares to transmit the results of 
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processing, namely the processed data and the condition 
codes, back to the central processing unit. When the central 
processing unit is signalled by the special instruction 
processor that it has finished processing, it signals the 
5special instruction processor to transmit the data. The 
special instruction processor then transmits the data and the 
condition codes. The central processing unit then can 
transmit the processed data bade into storage in the memory. 
In addition, the invention provides certain maintenance 
loand diagnostic features in which the central processing unit 
can force the special instruction processor to a selected 
state, and it can also determine the state of the special 
instruction processor. 

ISBrief Descripfcinn pf the Drawin^ g 

This invention is pointed out with particularity in the 
appended claims. The above and further objects and 
advantages of this invention may be better Understood by 
referring to the following description taken in conjunction 
2CKith the accompanying drawings in which: 

FIG. 1 is a block diagram of a digital data processing 
system constructed in accordance with this invention; 

PIG. 2 is a detailed block diagram of a portion of the 
central processing unit shown in FIG. 1; 
25 .. FIG. 3 is a detailed block diagram of a special 
instruction processing unit shown in FIG. 1, specifically a 
floating point accelerating unit; 
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FIG. 4 depicts a detailed circuitry in a portion of the 
control block shown in FIG. 3; 

FIG. 5 — ____ :-is a flow diagram 

depicting transfers between the central processing unit and 
Sspecial instruction processing unit shown in FIG. It 

Description of an Illustrative Embodiment 
I. General Description 

10 A. Data Processing System 

Referring to PIG. 1, the basic elements of a data 
processing system that embody this invention comprise a 
central processor unit 10, memory unit 11, and input/output 
units 12, which include terminals 13. The central processor 

ISunit communicates directly with certain of the input/output 
units 12 over an accelerator bus 14. The central processor 
unit IV communicates with memory unit 11 over a memory bus 
15, and the memory unit in turn communicates directly with 
others of input/output units 12 over an input/output bus 16. 

20The central processor unit Ijy communicates with terminals 13 
over a terminal bus 17. 

The central processor unit comprises a data processor 
2ff, and control store 21 which are connected to memory bus 
15, and a console processor 22. The console processor 

25 receives signals from terminal bus 17, and transfers them 
through control store 21 to data processor 20r. Data 
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processor Iff then operates on the information from console 
processor 22 and may transfer the information to the memory 
unit 11 for future processing, or it may process information 
directly. Similarly, data processor 2pr may transfer 
5 information through control store 21 to the console processor 
22, which may then transmit the information to terminal bus 
17 for transfer to one of terminals 13. The data processor 
also performs all communications over the accelerator bus 14 
with those input/output units 12. connected thereto. The 
10 communications with input/output units 12 over accelerator 
bus 14 are described in copending U.S. Patent Application 

Serial No. (Attorney's Docket 83-277), filed . 

As described below, the data path communicates directly 
with the memory unit 11 over memory bus 15, and indirectly 
ISwith the input/output bus 16 through memory unit 11. 
The control store 21 contains all of the 
microinstruction sequences that are used for processing the 
instructions that are received and executed by data processor 
2pr, and steps through the microinstruction sequences based on 
SOsequencing information from the data processor and timing 
information from a timing signal generator which it 
maintains . 

Memory unit 11 contains a memory controller 3? having 
one connection, or port, to memory bus 15, and a second 
25connected to input/output bus 16. One or more memory arrays 
31 connect to memory controller 31? and contain the 
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addressable memory storage locations that may be accessed 
directly by the memory controller. 

In addition to central processor unit 10, a floating 
point accelerator processor 23 may be connected to 
5accelerator bus 14, A floating point accelerator processor 
23 useful in the data processing system of FIG, 1, and the 
accelerator bus 14 are described herein. Floating point 
accelerator processor 23 receives floating point instructions 
from data processor Iff and is specially designed to process 
IQSuch instructions generally more rapidly than data processor 
Iff would normally be able to. 

Several types of input/output units 12 are shown in 
FIG. 1. A. communications adapter 4jJ can connect to 
synchronous and/or asynchronous data communications lines to 
15transfer information over, for example, conventional 
telephone lines, or to enable connection of the data 
processing system as one element in a local distributed 
processing network. Specific signals for the synchronous and 
asynchronous connection to communications adapter are not 
20shown; however, such signals would depend on the particular 
signal protocols used in such transmission, and are not a 
part of this invention. The communications adapter 4jr 
normally would include circuitry for buffering information 
during the synchronous or asynchronous transfers, and for 
ZSg.eperating control signals over the synchronous and 

asynchronous communications paths to enable the information 
to be transferred. The communications adapter Af! also 
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contains circuitry for transferring, information over 
input/output -.bus/16i^ Since 'the communications' adapter forms 
no part of this invention, it will not be described further 
herein.-;, .i: r : . 

5 Three other input/output Units 12 '-provide a secondary " 

storag€:vfacility for the data :pr6cessing system; They ' 
include; a tape controller ^l connected to 'a tape drive 42, 
and two disc- controllers 43 and- 44. * Disc controller 43 is • 
connected to a plurality of disto 'drives 45/ 'while disc ' 

locontroiier: r44. may .be connected to a^- disc driv^ •46''^and' to V 
P}^^^ of: ^isc drives^ ^Af, Disc contVoYlet'^^^'ls^ wWcVed'- 
to accelerator bus 14, and is describe'd In the^ Vforemfehtibned 
copending U.S.- Patent Application Serial' ifo;^ ' ' • " • ' 
(Attorney's Docket Ko. 83-277)/ filed '- •■ Units 41 and 

1543; and. their, respective storage elements may be constructed 

as described inr.U.S.r.PatentrNo. 3,999,163.: • ; i • • 

In one:.specific embodimertt of the data 'processihg" system 
of PIG;.1, .the = input/output bus is constructed' in accordance 
with U.S. Patent No. 3, 71?, 324, which describes in detail the 

SOsignals- required to transfer information thereover , These • ' 
signals are only briefly described herein, and reference 
should be made to that patent for a detailed explanation. 

Terminals 13 may include a tape drive 5?r,- or a system 
console 51, which are directly connected to terminal bus 17. 

25An optional remote console 52 may be provided to transfer 
signals with. terminal bus 17 over telephone lines through 
conventional modems (not shown); The remote console 52 can' 
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be used for remote diagnosis of system failures or for remote 
maintenance. The tape drive 50 may be usec3 for local 
maintenance or for transferring information into or out of 
the system. The system console may be used to provide direct 
5 operator control of the system, and may permit the operator 
to turn the system on or off, to initialize the system, and 
to step through a program sequence step-by-step . 

Before proceeding further, it may be useful to establish 
some definitions for terms that have already been used and 
IQwill be used throughout the remainder of this description. 

"Information" is intelligence that controls and provides 
the basis for data processing. It includes address, data, 
control and status 'information . 

"Data" includes information which is the object of or 
ISresult of processing. 

"Address" information identifies a particular storage 
location in which other information, such as data 
information, control or status information or other address 
information, is stored. 

20 "Control" information identifies particular operations 
to be performed. It includes commands between units of a 
data processing system that certain operations be performed, 
instructions to be performed by the central processor Iff or 
floating point accelerator processor 23, and it also includes 

25*information that modifies a unit's performance of an 
operation or execution of an instruction so as to enable 
certain actions to occur or disable actions from occurring • 
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An ••instruction"' is a step in a program that is executed 
by the central processor unit IpT or floating point 
accelerator processor 23. Each step may be executed by the 
respective processor executing one or more microinstructions. 
5 Each microinstruction is stored in a specific location, which 
is identified as a micro-address. Other units, for example, 
memory controller 3ff, also perform operations in response to 
and as defined in sequences of microinstructions. 

••Status" information identifies the condition of various 
10 signals generated by a unit at various times during the 
processing of an operation or execution of an instruction. 
B. Central Processor Unit Ipr 

PIG. 2 illustrates, in general block diagram form, 
portions of central processor Iff, including data processor 20 

15 and control store 21^ that may be useful in the data 
processing system of FIG. 1. 

Data processor Iff includes a data path 60 that includes 
an arithmatic logic unit and a plurality of general purpose 
registers (not shown). In one specific embodiment of this 

20 invention, one of the general purpose registers is used as a 
program counter to identify the storage location in memory 
containing the next instruction to be executed by the 
processor Iff and another register is used as a stack pointer 
used during the servicing of interrupts and subroutines, as 

25described in 0.5. Patent No. 3,71jy,324. The data path 6? 
receives information from, or transfers information to, the 
accelerator bus 14, the memory bus 15, or from a plurality of 
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console registers 61 that in turn receive add store 
information fronir or transfer information to, console 
processor 22 over a console bus 62, 

Operations performed by data path 6!? are controlled by 
5 instructions stored in an instruction buffer 63, which 
receives each instruction fetched from memory unit 11 
identified by the program counter register in data path 60r. 
Alternatively, the operations performed by data path 60" can 
be controlled by an interrupt processor 64 which receives 

10 requests for interrupt service from accelerator bus 14, 
console bus 62 (through console registers 61} and from the 
input/output bus 16. The interrupt processor 64 also 
receives the interrupt priority level at which the processor 
Ijy is then operatirig and, if the interrupt request has a 

15higher priority, acknowledges the interrupt and causes the 
processor Iff to service the interrupt request* A 
microsequencer 65 generates a microaddress that is used by a 
micro-control store 66 in control store 21 to access a 
microinstruction depending on the instructions stored in 

20instruction buffer 63, or the interrupt being serviced by 
interrupt processor 64. The microsequencer 65 generates the 
microaddress in response to the particular instruction in 
instruction buffer 63 then being processed, and the 
acknowledgement of an interrupt by interrupt processor 64, as 

25well as timing signals generated by a clock generator 67 in 
control store 21. 
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C. Floating Point Accelerator 21 

A general block diagram of a flo.ating point accelerator 
23 is Shown in FIG. 3. A floating point instruction as 
transferred to the floating point accelerator by the central 
5 processing unit ipr, specifically by the data path 2,y, is 
received in an instruction decoder one specific 

embodiment of floating point accelerator 23 is 
microprogrammed. The instruction decoder is thus connected 
to a microaddress sequencer Iff! in a conventional manner 
lOwhich in turn supplies microaddresses to a control store 103. 
The control store contains microinstructions that control the 
operations of floating point accelerator 23, including a data 
path l^,. The 'data path contains operand storage registers 
and an arithmetic and logic unit that processes the operands 
ISthat are received from the central processor unit 1, through 
a transceiver buffer iprs. 

The microaddress sequencer 102, using conventional • 
microinstruction addressing techniques, supplies a -next 
Microaddress" to the control store 1^3. The microaddress 
20sequencer W also receives signals from the control store 
172 and from a branch logic l(r6, and generates the next 
microaddress that is then transmitted to the control store. 
The control store thus generates microinstructions that 
control the data path logic l(ar4 based on a predetermined 
25mi.croinstruction sequence. The specific sequence is 

determined by the instruction from central processor unit 10, 
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as modified by certain conditions in the floating point 
accelerator 23 as reflected by branch logic 106. 

Data path logic 1)74 also uses conventional data paths to 
process floating point instructions. The data path contains 
56eparate data paths for the exponent and for the mantissa r 
and operates on each according to the microinstructions from 
control store 1|?3. In one specific embodiment r the data path 
logic comprises conventional AWD 29j71 bit slice 
microprocessors sold by Advanced Micro Devices, Inc. 

10 Floating point accelerator 23 also includes interface 
control circuitry Iff! that also receives signals from the 
central processing unit and assists in synchronizing the 
transfer of operands and processed data between the central 
processor unit Iff and floating point accelerator 23. A 

15detailed description of circuitry in control circuitry 1U7 is 
presented in FIG. 4. 

In accordance with the invention, the central processing 
unit serially retrieves each instruction from the memory unit 
11. Each instruction is loaded into instruction buffer 63 

20(FIG. 2). The instruction buffer and the associated 
processor then decode the instruction. If the instruction is 
a floating point instruction, that is, an instruction that 
indicates that the operands are in floating point format, it 
transmits the instruction to the floating point accelerator 

25over the BUS IB D(7:0) 1(78 (FIG 3.), simultaneously asserting 
a synchronizing IRD STATE instruction read signal 1179- 
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The instruction processor 63 (PIG. 2) in central 
processor unit Iff further decodes the instruction and 
proceeds to retrieve the operands from the memory unit 11. 
Each instruction may require one or more operands, and an 
5 operand may comprise one or more data words in the memory 
depending on the required degree of arithmetic precision, 
each data word requiring a memory retrieval. Each data word 
is transferred through the ALU epr (fig. 2) of the central 
processor unit IJjr, and transmitted over BUS Y D(31:0jr) 
10 (PIG 3), the data section of accelerator bus 14. Each 
transfer over BOS Y (D31:0pf) 11(7 is synchronized to the 
assertion of a CPU DATA AVAIL CPU data available signal 111. 
The floating point accelerator 23 receives each data 
_ word, in buffer 105, and transfers it onto a BUS FPA floating 
ISpoint accelerator bus. The control store 103 determines 
which registers of data path logic 104 each transferred data 
word is to be stored in, depending on whether the word forms 
part of the mantissa or the exponent. The instruction 
previously transferred into instruction decoder 101 also 
20identifies the number of operands and the number of data 
words, required to process the instruction. Control store 
103 causes the data words to be transferred through buffer 
105 and into registers in data path logic 104 in accordance 
with the number of words to be received. After all the 
25operands have been received, the data path logic 104, under 
control of control store 103, executes the instruction in a 
conventional manner . 
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After the instruction has been executed, the control 
store l?3 transmits an ACC SYNC accelerator synchronization 
signal over line 12J? (FIG. 3) to central processor unit 1!?. 
This signal is a signal in the microinstruction generated by 
5 control store icr3 following execution of each instruction. 
The central processing unit Iff receives the signal and 
services it as an interrupt service request- When it is 
ready to receive the processed data from the floating point 
accelerator, it asserts a READ PORT signal on a line 121 and 

10 a SEL ACC IN select accelerator in signal on line 122. Since 
several units may be connected to accelerator bus 14, the SEL 
ACC IN select accelerator in signal serves to enable the 
floating point accelerator 23 to transmit on the BUS Y 
D(31;017) llff and prevents other attached units from 

15 transmitting data over those lines. In successive clock 
cycles, the floating point accelerator 23 transmits the 
condition codes and data words comprising the processed data 
from data path logic lfr4 across BUS FPA 112 through buffer 
Ijys and across BUS Y D(31:0pf) 119- The central processor 

20 unit Iff, specifically data path 2fl, receives the condition 
codas and data words and transmits them to the memory unit 
11. 

The arrangement according to the invention provides a 
much less complex interconnection arrangement between central 
25 processor unit Iff and a floating point accelerator 23. The 
special instruction, in one specific embodiment of a floating 
point instruction, is transmitted by the central processing 
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unit ler to floating point accelerator processor 23, and the 
operands are then automatically and sequentially retrieved by 
data path 29f and transmitted directly to floating point 
accelerator 23 with no further interaction between them. 
5When the condition codes and processed data are ready, the 
floating point accelerator signals the central processing 
unit. When the central processing unit is ready to receive 
the processed data, it signals the floating point 
accelerator, which responds with successive data words. Both 
lOthe central processor unit and floating point accelerator 
determine the number of data words to be transferred in both 
directions, as operands and processed data, based on the 
contents of the instructions. Therefore, the only 
communication back and forth as to the number of data words 
15to be transferred is by way of the instruction. This 
simplifies the floating point accelerator and the 
communications therebetween. 

The central processing unit 10 and floating point 
accelerator also provide maintenance and diagnosing features 
2Qin which the central processor unit 10r „ay transmit a "next 
microaddress- to the floating point accelerator control store 
lff2 to place the control store in a specific and known 
address condition. Furthermore, the central processor unit 
may read, or retrieve, the "next microaddress- that is 
a^upplied by microaddress sequencer Iff! to control store 1^3. 

The operation by which the central processor unit forces 
the control store l|3r3 to a known address condition will be 



3NSOOCIO:<£P 0092429A2> 



0092ij23.. 

- 19 - 

described in connection with FIG. 3. The central processor 
unit transmits a microaddress over BUS Y 0(31:0?) lljr, and 
asserts a TRAP ACC trap accelerator signal on line 123. The 
TRAP ACC trap accelerator signal is received in control 
jcircuitry IJ??, which inhibits the microaddress sequencer 1(J2 
from transmitting a next microaddress onto the control store, 
and enables the signals on the BUS Y D(31:0pr) to be 
transmitted through buffer 1(J5 and to the "next microaddress*' 
input to control store 1)?3. The control store 103 then 

IQtransmits the microinstruction identified by the supplied 
signals, forcing a shift to the sequence defined by the 
address transmitted on BUS Y (031:00) 110. 

The operation in which the central processor unit 10 
retrieves the microaddress transmitted by microaddress 

15sequencer 102 will be explained in connection with FIG. 3. 
The central processor unit first transmits a REAO ACC UPC 
read accelerator microprogram . counter signal on line 124 to 
control circuitry 107. This conditions the floating point 
accelerator 23 to transfer the next microaddress generated by 

20microaddress sequencer 102 to transceiver 132 in buffer 105. 
During the next succeeding clock cycle, the central processor 
unit asserts the READ PORT signal on line 121. At that time, 
the "next microaddress" is transmitted through buffer 105 
onto BUS Y D(31:00) in synchronism with the control store's 

25assertion of the ACC SYNC accelerator sync signal on line 120 
from control store 103. At the end of the clock cycle, the 
"next microaddress" is removed from the BUS Y 0(31:00) 110 
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and the floating point accelerator 23 returns to normal 
operation, 

PIG. 4 depicts certain circuitry in control 1^7 that 
enables the floating point accelerator 23 to receive the READ 
5P0RT, SEL ACC IN select accelerator in signal, READ ACC UPC 
read accelerator microprogram counter signal, and trap acc 
trap accelerator signal and enables it to transmit and 
receive the associated signals over BUS Y D(3l:0jy). 

The READ PORT and SEL ACC IN select accelerator in 
lOsignals are received in the control circuit ljr7 and coupled 
to an AND gate 131, The coincidence of these signals" causes 
the assertion of an CPU RCV DATA central processing unit 
receive data signal which is coupled to branch logic 1^6, 
This causes the miccoaddress sequencer W to shift control 
ISstore m to a sequence that causes the floating point 
accelerator 23 to transmit processed data and the condition 
codes to central processor unit Iff. This signal also 
conditions a transceiver 132 in buffer 1?5 between BOS PPA 
112 and BUS Y D(31:0(r, lig to transmit data from floating 
20point accelerator 23 to central processor unit Iff. 

The CPU RCV DATA central processing unit receive data 
signal also energizes OR gates 134, 135 and one input to AND 
gate 136, which is also energized by a CPU PH0 clocking 
signal from central processor unit 10r, and an ALLOW CPU Y BUS 
25signal from control store l!f2 to enable transceiver 132. 

When the TRAP ACC trap accelerator signal is asserted, 
neither the READ PORT nor the SEL ACC IN select accelerator 
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in signals are asserted. Therefore, the CPU RCV DATA centre 
processor unit receive data signal is not asserted. 
Transceiver 132 is thus conditioned to transfer signals from 
BUS y D(31:jJJ3r) lift to BUS FPA 112 through buffer Ijys, The 
Stransceiver is enabled by the TRAP ACC trap accelerator 
signal through OR gates 134 and 135, and AND gate 136 when 
the CPU PH0 timing signal and ALLOW CPU Y BUS signal from 

control store lff3 are asserted. 

The TRAP ACC trap accelerator signal also causes a flip- 

lOflop 14pr to be set through AND gate 141 at the coincidence of 
the CPU PH^f and CLK OFF timing signals. The setting of flip- 
flop 14!? causes a FORCE UADRS force microaddress signal to be 
asserted. As shown in FIG. 3, this signal is transmitted 
from control Iffl to microaddress sequencer 11?2. When the 

15F0RCE UADRS force microaddress signal is asserted, the 
address from microaddress sequencer 1J?2 is not transmitted to 
control store 1(73. The FORCE UADRS force microaddress signal 
enables a transceiver 141 to transfer a next microaddress 
from the BUS FPA 112 to control store igr3. The transceivers 

20141 are enabled through an OR gate 143. The flip-flop 14Cr is 
reset at the assertion of the next GPU PH J? timing signal but 
through an AND gate 144. 

The control 19fl receives the READ ACC UPC read 
accelerator microprogram counter signal in a flip-flop 15?, 

25which generates a READ UADRS read microaddress signal • This 
READ UADRS read microaddress signal is coupled through OR 
gate 143 to enable transceiver 142. However, since the FORCE 
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OADRS force roicroaddcess signal is not asserted, the 
transceiver 14 2 is conditioned to transmit the next 
microaddress signal from microaddress sequencer 1^2 onto BUS 
FPA 112. The flip-flop 15jy is then reset through AND gate 
5 151 at the next CPU PH0 clocking signal. 

FIG-. 5 depicts detailed sequences of the operations of 
transferring instructions, operands, and processed data, as 
well as microaddresses between CPU Ipr and floating point 
accelerator 23. The sequences depicted in FIG. 5 are self- 
10 explanatory and a detailed description of these operations is 
not required here for an understanding of the invention. 

The foregoing description is limited to a specific 
embodiment of this, invention. It will be apparent, however, 
that this invention can be practiced in systems having 
15 diverse basic construction or in systems that use different 
internal circuitry than is described in this specification 
with the attainment of some or all of the foregoing objects 
and advantages of this invention. It is the object of the 
appended claims to cover all such variations and 
20 modifications as come within the true spirit and scope of 
this invention. 



SNSOOCIO:<EP 0092429A2> 



0092^29 ' 

CLAIMS: - 23 - 

1. A special instruction processor for processing a 
predetermined class of instructions on a predetermined class 
of operands, the special instruction processor for connection 
to a central processing unit that retrieves instructions and 
5 data from a memory and for transmitting processed data to a 
meraoryr the instructions in said predetermined class 
identifying the number of operands and the number of words 
required to process the instruction ^ the special instruction 
processor comprising : 
10 A. means for receiving instructions of said predetermined 
class transmitted from the central processing unit and for 
decoding the instructions to determine the numbers of 
operands and words required to be processed; 

B. means responsive to said decoding means and to a signal 
15 from the central processing unit for receiving in successive 

cycles; 

C. means responsive to said decoding means for processing 
the operands and for signalling the central processor unit 
when the processing has finished; and 

20 D. means responsive to said decoding means and successive 
transfer requests from the central processing unit for 
transmitting processed data to the central processing unit, 
including 

i, means responsive to a first selection signal from 
25 the central processing means for enabling said transmitting 
means , and 
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ii. means responsive to said selection means and to 
said successive transfer requests from the central processing 
unit for synchronizing successive transfers of processed data 
to the central processing unit. 

5 2. A special instruction processor as defined in claim 1 
wherein said special instruction processor processes 
instructions in a series of states, said special instruction 
processor further including 

E, means responsive to synchronizing signals from said 
10 central processing unit for receiving signals that direct the 
special instruction process to one of the states in the 
series . 



3. A special instruction processor as defined in claim 1 
wherein said special instruction processor processes 
15 instructions in a series of states, said special instruction, 
processor further including means responsive to synchronizing 
signals from said central processing unit for transmitting 
signals to the central processing unit that identify the 
state of the special instruction processor. 
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FIG 5 



CPU 10 RETRIEVES INSTRaCTION FROM MEMORY,.. 

AND DECODES IT, TESTS INSTRUCTION TO DETERMINE 

IF A FLOATING POINT INSTRUCTION 



IF INSTRUCTION IS A FLOATING POINT 
INSTRUCTION CPU 10 PLACES INSTRUCTION ON 
BUS IB D(7:0) AND ASSERTS IRD STATE SIGNAL 



FPA23 RECEIVES INSTRUCTION AND DECODES 
IT, AND PREPARES TO RECEIVE DATA 
WORDS CONSTITUTING OPERANDS 



CPUlO RETRIEVES A DATA WORD FORMING PART OF 
OPERANDS FROM MEMORY, ASSERTS CPU DATA AVAIL ^ 
SIGNAL IN SUCCESSIVE CLOCK CYCLES AND TRANSMITS 
THE DATA WORDS ON THE BUS Y D{31:00) 



FPA 23 RECEIVES THE DATA WORD 



CPU 10 AND FPA 23 BOTH DETERMINE WHETHER 

ALL DATA WORDS FORMING OPERANDS 

HAVE BEEN TRANSP'ERRED 



YES 



FPA 23 PROCESSES INSTRUCTION 



FPA 23 TRANSMITS ACC SYNC SIGNAL TO CPU 10 



•NO' 



CPUlO PREPARES TO RECEIVE CONDITION CODES AND 
PROCESSED DATA FROM FPA 10 • CPU 10 ASSERTS 

SEL ACC IN SIGNAL AND IN SUCCEEDING ^ 

CLOCK CYCLES THE READ PORT SIGNAL 



FPA23 TRANSMITS THE CONDITION CODES 
AND PROCESSED DATA OVER BUS Y D(31:00) 

\ 

CPU 10 AND FPA 23 BOTH DETERMINE WHETHER ALL 
DATA WORDS HAVE BEEN TRANSFERRED 

YES 



-N0-< 



END 
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